Model Selection

Multimodal training

# Multimodal training

Mobileclip S2 Timm

MobileCLIP-S2 is an efficient image-text model that achieves rapid inference through multimodal reinforcement training, delivering outstanding zero-shot performance while maintaining a compact size.

Mobileclip S0 Timm

MobileCLIP-S0 is an efficient image-text model achieved through multimodal reinforcement training, significantly improving speed and size efficiency while maintaining high performance.

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase